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1  List  of  Numerical  Productivity  Measures 

•  Refereed  papers  submitted  but  not  yet  published:  2 

•  Refereed  papers  published:  7 

•  Un refereed  reports  and  articles;  2 

•  Books  or  parts  thereof  submitted  but  not  yet  published;  0  .  < 

•  Books  or  parts  thereof  published:  0  ^  /  i9S2 

•  Patents  filed  but  not  yet  granted;  0 

•  Patents  granted  (include  software  copyrights);  0 

•  Invited  presentations;  12 

•  Contributed  presentations:  0 

•  Honors  received  :  1  (Drew  McDermott  was  Conference  Chair  of  the  First 
International  Conference  on  AI  Planning  Systems) 

•  Prizes  or  awards  received  (Nobel,  Japan,  Turing,  etc.):  also  include  de¬ 
scriptions  of  the  specific  prizes.  0 

•  Promotions  obtained;  0 

•  Graduate  students  supported  >  25%  of  full  time;  2 

•  Post-docs  supported  >  25%  of  full  time:  0 

•  Minorities  supported  (include  Blacks,  Hispanics,  American  Indians  and 
other  native  Americans  such  cis  Aleuts,  Pacific  Islanders,  etc.;  do  not  in¬ 
clude  Asians  or  Asian-Americans);  0 
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2  Summary  of  Technical  Results 

The  goal  of  our  project  is  to  study  planning  for  autonomous  agents  with  imper- 
Rct  sensors  in  a  dynamic  world.  Such  agents  must  confront  several  problems: 

•  how  to  synchronize  plan  execution  with  plan  refinement 

•  how  to  generate  reasonable  plans  quickly  for  complex  goals,  and  improve 
them  later 

•  how  to  trade  off  sensor-processing  time  against  the  quality  of  information 

•  and  how  to  learn  the  structure  of  the  environment  as  plan  execution  pro¬ 
ceeds. 

We  have  results  in  all  these  areas. 


2.1  Transformational  Planning  of  Reactive  Behavior 

In  complex,  dynamic  worlds,  plans  must  include  sensing  operations,  and  an 
agent  does  not  have  time  to  make  up  such  complex  plans  from  scratch  when  a 
goal  pops  up.  The  alternative  is  to  provide  a  set  of  modular  plans  that  can  cope 
with  most  eventualities,  and  then  pciste  these  modules  together  to  handle  what¬ 
ever  constellation  of  goals  arises.  This  planning  algorithm,  if  you  want  to  call 
it  that,  is  fast,  but  prone  to  producing  inefficient  plans.  Hence,  simultaneously 
with  the  execution  of  the  system’s  default  plan,  an  off-line  planner  attempts 
to  find  an  improved  version.  It  projects  the  plan  to  generate  sample  execution 
traces,  runs  critics  that  look  for  standard  “bugs”  for  the  domain,  and  tries  plan 
transformations  to  attempt  to  ameliorate  those  bugs. 

In  the  past  year,  we  have  developed  several  technical  results  within  this 
framework.  For  example,  when  an  improved  plan  is  found,  the  current  plan  must 
be  abandoned.  Our  model  of  plan  swapping  is  that  the  agent  simply  discards 
the  old  plan  and  begins  the  new  one.  The  new  plan  typically  runs  some  sensing 
operations  to  orient  itself,  and  avoids  repeating  work  done  by  the  old  version. 
However,  this  model  is  prone  to  a  particular  type  of  bug,  where  changes  in 
the  agent’s  belief  state  as  a  result  of  actions  may  not  be  incorporated  because 
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the  plan  is  discarded  between  the  beginning  of  the  action  and  the  recording 
of  the  belief  changes.  We  have  eliminated  this  kind  of  bug  by  addition  of  an 
“evaporation  protection”  primitive  into  our  plan  language.  This  primitive  forces 
some  plan  steps  to  be  carried  out  even  if  the  plan  has  been  discarded. 

It  is  important  when  a  critic  processes  a  plan  projection  that  it  be  able  to 
recover  the  state  of  all  data  structures  at  any  point  in  a  plan.  We  have  developed 
an  efficient  algorithm  for  recording  and  time-stamping  all  side  effects  during  a 
projection.  During  the  criticism  phase,  the  exact  state  of  *he  agent  can  be 
reconstructed  at  any  point.  The  time  and  space  required  are  both  linear  in  the 
number  of  side  effects  to  the  data  structure. 

Reactive  plans  involve  a  high  degree  of  concurrency.  Plans  often  request 
resources,  such  cis  effectors,  and  block  while  they  wait  for  them  to  become  avail¬ 
able.  the  planner  puts  disparate  modules  together,  and  then  transforms 

the  resulting  code,  it  must  cope  with  the  likelihood  that  the  resulting  plans  will 
suffer  from  deadlocks,  in  which  a  cycle  of  tasks  are  waiting  for  each  other  to 
release  resources.  We  have  developed  a  clean  combination  of  the  task  idea  from 
AI  planning,  and  the  process  idea  from  operating  systems.  A  task  is  a  commit¬ 
ment  to  carry  something  out,  and  often  gets  reduced  to  subtasks  that  jointly 
do  the  work.  Some  tasks  correspond  to  processes.  A  subtask’s  process  ought  to 
inherit  resources  from  its  supertasks’  processes,  because  it  corresponds  to  a  part 
of  its  supertasks,  and  will  presumably  advance  them.  Our  process  mechanism 
has  this  feature,  and  supports  an  efficient  mechanism  for  pre-empting  resources 
and  breaking  deadlocks. 

One  of  our  most  important  results  is  our  model  of  declarative  goals  in  reactive 
plans.  A  declarative  goal  is  an  explicit  statement  of  a  condition  to  be  made  true. 
Our  model  assumes  that  tasks  are  procedural,  so  we  must  provide  a  fast  and 
faithful  transition  from  the  declarative  language  to  the  procedural  one.  Our 
method  is  to  provide  for  each  declarative  goal  a  robust  default  method  that  will 
under  normal  circumstances  achieve  that  goal,  possibly  with  less  than  optimal 
efficiency.  For  example,  the  default  method  for  the  goal  Vx(P(x)  Z>  Qi^)) 
is  to  find  an  object  6  that  has  property  P  but  not  property  Q,  and  achieve 
Q{b).  Retrieving  and  instantiating  default  methods  is  quite  fast,  and  gives  the 
agent  something  useful  to  do  while  it  attempts  to  find  a  better  plan  offline. 
Its  search  for  a  better  plan  is  mediated  by  a  set  of  transformations  that  refine 
particular  patterns  of  goals  into  specialized  actions.  For  example,  if  P  mentions 
the  location  of  an  object,  then  there  may  be  specific  methods  for  going  to  a 
location  and  scanning  for  objects  of  particular  sorts.  These  transformation 
rules  are  associated  with  decision-theoretic  heuristics  for  judging  whether  they 
are  likely  to  be  effective  given  the  behavior  of  the  objects  being  manipulated. 

2.2  Interval-Based  Inference  on  Sensor  Data 

Many  sensor-data-process  ng  tasks  can  be  be  phrased  in  terms  of  finding  values 
of  parameters  that  satisfy  given  constraints.  For  example,  determining  whether 
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a  certain  set  of  edges  in  an  image  could  be  an  instance  of  an  object  model  can 
be  thought  of  as  finding  values  for  the  object’s  parameters  that  account  for 
the  given  edges.  Our  approach  to  finding  such  values  is  to  start  with  intervals 
containing  the  correct  values,  and  gradually  prune  away  subintervals  that  are 
inconsistent,  until  we  are  left  with  subintervals  that  are  guaranteed  to  contain 
at  least  one  point  satisfying  the  constraints. 

In  the  last  year,  we  have  made  the  following  advances: 

•  Implemented  a  distributed  version  of  the  algorithms.  The  resulting  im¬ 
plementation  is  significantly  faster  (i.e.  parallel)  and  more  fault-tolerant. 

•  Implemented  a  data  selection  technique  that  reduced  the  time  needed  to 
solve  some  benchmark  problems  to  less  than  1  second  (essentially  real-time 
operation). 

•  Implemented  and  tested  the  algorithms  on  data  requiring  descrimination 
and  comparisons  among  multiple  objects  or  targets. 

•  Implemented  a  version  of  the  algorithms  for  use  in  unstructured  domains. 
This  significantly  increases  the  domain  of  applicability  of  the  algorithms, 
making  it  possible,  for  example,  to  apply  them  outdoors. 

We  have  also  been  using  interval-based  techniques  for  representing  and  con¬ 
structing  maps  of  the  environment  for  mobile  robots.  Maps  are  represented  as 
graphs  cf  places,  where  each  place  is  related  to  its  neighbors  by  its  relative  loca¬ 
tion  (restricted  to  a  set  of  intervals)  and  the  actions  that  will  get  the  agent  from 
one  neighbor  to  another.  In  addition,  stored  with  each  place  is  a  collection  of 
views,  which  are  visual  signatures  obtained  at  that  place.  Two  places  look  alike 
if  their  signatures  are  sufficiently  near.  The  key  problem  in  map  building  is  to 
decide  when  the  agent  is  now  at  a  place  it  has  visited  before.  Our  algorithm 
makes  that  decision  based  on  overlaps  of  the  current  measure  location  and  the 
stored  location  of  the  candidate  matching  location.  An  important  innovation 
is  that  the  algorithm  continues  to  gather  statistics  on  its  identifications,  and 
changes  its  mind  when  it  thinks  it  must  split  previously  merged  places,  merge 
previously  distinctly  places,  or  delete  places  that  were  erroneously  observed  in 
the  first  place. 

Over  the  past  year,  we  have  developed  a  class  of  plans,  called  exploration 
scripts,  for  speeding  up  the  pace  of  map  building;  the  basic  mapping  algorithm 
does  not  require  getting  control  of  the  robot,  but  can  profit  from  an  opportu¬ 
nity  to  do  so.  More  recently,  we  have  gathered  data  on  what  makes  an  image 
signature  distinctive  (less  likely  to  match  lots  of  places).  The  idea  is  to  rotate 
the  robot’s  camera  until  a  distinctiveness  peak  is  attained.  The  hope  is  that  any 
given  location  will  have  few  distinctiveness  peaks,  allowing  us  to  store  only  a 
view  fews.  Preliminary  experiments  seem  to  show  that  this  hope  is  reasonable. 
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2560 

•  Sean  P.  Engelson  and  Drew  McDermott  1992  “Active  Place  Recognition 
Using  Image  Signatures”,  to  appear  in  Proceedings  of  SPIE  Sensor  Fusion 
V,  November  1992 
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•  Gregory  D.  Hager  1991  Towards  geometric  decisioi.  making  in  unstruc¬ 
tured  environments.  In  Proc.  1991  International  Workshop  on  Intelligent 
Robots  and  Systems,  Bellingham,  WA,  pp.  1412-1417. 

•  Gregory  D.  Hager  1992  “A  Constraint- Based  View  of  Selective  Percep¬ 
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•  Gregory  D.  Hager  1992  “Constraint  Solving  Methods  and  Sensor-Based 
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•  Gregory  D.  Hager,  June  1992,  “Sensor-Based  Decision  Making”  presented 
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•  Gregory  D.  Hager  1992  Task-directed  computation  of  qualitative  decisions 
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•  Drew  McDermott,  Nov.  1991,  Invited  presentation  on  “Transformational 
Planning  of  Reactive  Behavior”  at  Ohio  State  University. 
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•  Drew  McDermott,  March  1992,  Talk  at  Carnegie- Mellon  on  “Transforma¬ 
tional  Planning  of  Reactive  Behavior.” 

•  Drew  McDermott,  April  1992,  Talk  at  University  of  Chicago  on  “Building 
and  Fixing  Diktiometric  Maps  for  Robot  Navigation.” 

•  Drew  McDermott,  April  1992,  Talk  at  Northwestern  University,  “Trans¬ 
formational  Planning  of  Reactive  Behavior.” 
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of  Maryland. 

•  Drew  McDermott,  July  1992,  Seminar  on  “Classical  and  Reactive  Plan¬ 
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4  Research  Transitions  And  DoD  Interactions 

Prof.  McDermott  has  been  serving  on  the  Technical  Review  Board  for  the 
DARPA  Transportation  and  Scheduling  Initiative.  The  purpose  of  the  board  is 
to  provide  high-level  feedback  to  researchers  in  this  area,  using  insights  gained 
from  pcist  research  on  planning  and  scheduling. 
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5  Software  And  Hardware  Prototypes 

1.  The  Reactive  Plan  Language  interpreter  is  available,  and  runs  in  Lucid 
Common  Lisp.  We  are  attempting  to  port  it  to  the  University  of  Wash¬ 
ington,  and  other  institutions. 

2.  We  have  recently  finished  a  version  of  the  constraint  solving  algorithms 
that  we  are  giving  to  several  interested  institutions. 
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